Picture for Mengnan Du

Mengnan Du

Law of Neural Interaction: Depth-Width Shape, Interaction Efficiency, and Generalization

Add code
May 27, 2026
Viaarxiv icon

Universal Activation Verbalizer: A Unified Framework for Cross-Model Activation Explanation

Add code
May 25, 2026
Viaarxiv icon

Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs

Add code
Mar 03, 2026
Viaarxiv icon

FinAnchor: Aligned Multi-Model Representations for Financial Prediction

Add code
Feb 24, 2026
Viaarxiv icon

AdaJudge: Adaptive Multi-Perspective Judging for Reward Modeling

Add code
Jan 13, 2026
Viaarxiv icon

NeuronScope: A Multi-Agent Framework for Explaining Polysemantic Neurons in Language Models

Add code
Jan 07, 2026
Viaarxiv icon

Rep2Text: Decoding Full Text from a Single LLM Token Representation

Add code
Nov 09, 2025
Viaarxiv icon

KnowThyself: An Agentic Assistant for LLM Interpretability

Add code
Nov 05, 2025
Viaarxiv icon

AdaptiveK Sparse Autoencoders: Dynamic Sparsity Allocation for Interpretable LLM Representations

Add code
Aug 24, 2025
Viaarxiv icon

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective

Add code
Aug 11, 2025
Viaarxiv icon